Picture for Huiyu Duan

Huiyu Duan

Affiliation 1

LL-Bench: Rethinking Low-Level Vision Evaluation in the Era of Large-Scale Generative Models

Add code
Jun 01, 2026
Viaarxiv icon

GeoR-Bench: Evaluating Geoscience Visual Reasoning

Add code
May 12, 2026
Viaarxiv icon

DynT2I-Eval: A Dynamic Evaluation Framework for Text-to-Image Models

Add code
May 07, 2026
Viaarxiv icon

LoViF 2026 The First Challenge on Holistic Quality Assessment for 4D World Model (PhyScore)

Add code
May 06, 2026
Viaarxiv icon

Market-Bench: Benchmarking Large Language Models on Economic and Trade Competition

Add code
Apr 07, 2026
Viaarxiv icon

ITIScore: An Image-to-Text-to-Image Rating Framework for the Image Captioning Ability of MLLMs

Add code
Apr 04, 2026
Viaarxiv icon

Preference-Guided Debiasing for No-Reference Enhancement Image Quality Assessment

Add code
Mar 20, 2026
Viaarxiv icon

Evaluating Image Editing with LLMs: A Comprehensive Benchmark and Intermediate-Layer Probing Approach

Add code
Mar 20, 2026
Viaarxiv icon

EditHF-1M: A Million-Scale Rich Human Preference Feedback for Image Editing

Add code
Mar 16, 2026
Viaarxiv icon

Learning to Wander: Improving the Global Image Geolocation Ability of LMMs via Actionable Reasoning

Add code
Mar 11, 2026
Viaarxiv icon